CLOSET : An E cient Algorithm for Mining Frequent ClosedItemsets
نویسندگان
چکیده
Association mining may often derive an undesirably large set of frequent itemsets and association rules. Recent studies have proposed an interesting alternative: mining frequent closed itemsets and their corresponding rules, which has the same power as association mining but substantially reduces the number of rules to be presented. In this paper, we propose an eecient algorithm, CLOSET, for mining closed itemsets, with the development of three techniques: (1) applying a compressed, frequent pattern tree FP-tree structure for mining closed itemsets without candidate generation, (2) developing a single preex path compression technique to identify frequent closed itemsets quickly, and (3) exploring a partition-based projection mechanism for scalable mining in large databases. Our performance study shows that CLOSET is eecient and scalable over large databases, and is faster than the previously proposed methods.
منابع مشابه
CLOSET: An Efficient Algorithm for Mining Frequent Closed Itemsets
Association mining may often derive an undesirably large set of frequent itemsets and association rules. Recent studies have proposed an interesting alternative: mining frequent closed itemsets and their corresponding rules, which has the same power as association mining but substantially reduces the number of rules to be presented. In this paper, we propose an e cient algorithm, CLOSET, for mi...
متن کاملE cient Vertical Mining of Frequent Closures and Generators
The e ective construction of many association rule bases requires the computation of both frequent closed and frequent generator itemsets (FCIs/FGs). However, only few miners address both concerns, typically by applying levelwise breadthrst traversal. As depthrst traversal is known to be superior, we examine here the depthrst FCI/FG-mining. The proposed algorithm, Touch, deals with both tasks s...
متن کاملTGC-Tree: An Online Algorithm Tracing Closed Itemset and Transaction Set Simultaneously
Finding Association Rules is a classical data mining task. The most critical part of Association Rules Mining is finding the frequent itemsets in the database. Since the introduce of the famouse Apriori algorithm [14], many others have been proposed to find the frequent itemsets. Among all the algorithms, the approach of mining closed itemsets has arisen a lot of interests in data mining commun...
متن کاملDiscovery of Frequent Patterns in Large Data Collections
Data mining or knowledge discovery in databases aims at nding useful reg ularities in large data sets Interest in the eld is motivated by the growth of computerized data collections and by the high potential value of patterns discovered in those collections For instance bar code readers at supermar kets produce extensive amounts of data about purchases An analysis of this data can reveal useful...
متن کاملEvoMiner: Frequent Subtree Mining in Phylogenetic Databases Technical Report #11-08, Dept. of Computer Science, Iowa State University
The problem of mining collections of trees to identify common patterns, called frequent subtrees (FSTs), arises often when trying to make sense of the results of phylogenetic analysis. FST mining generalizes the well-known maximum agreement subtree problem. Here we present EvoMiner, a new algorithm for mining frequent subtrees in collections of phylogenetic trees. EvoMiner is an Apriori-like le...
متن کامل